NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

Lihan, Zha; Cui, Yuchen; Lin, Li-Heng; Kwon, Minae; Gonzalez, Montserrat; Zeng, Andy; Xia, Fei; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))

Full Text Available
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

Zha, Lihan; Cui, Yuchen; Lin, Li-Heng; Kwon, Minae; Gonzalez, Montserrat; Zeng, Andy; Xia, Fei; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))

Full Text Available
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

Zha, Lihan; Cui, Yuchen; Lin, Li-Heng; Kwon, Minae; Gonzalez-Arenas, Montserrat; Zeng, Andy; Xia, Fei; Sadigh, Dorsa (May 2024, International Conference on Robotics and Automation (ICRA))

Full Text Available
Rearrangement Planning for General Part Assembly

Li, Yulong; Zeng, Andy; Song, Shuran (November 2023, Proceedings of the 7th Conference on Robot Learning)
Tan, Jie; Toussaint, Marc; Darvish, Kourosh (Ed.)
Most successes in autonomous robotic assembly have been restricted to single target or category. We propose to investigate general part assembly, the task of creating novel target assemblies with unseen part shapes. As a fundamental step to a general part assembly system, we tackle the task of determining the precise poses of the parts in the target assembly, which we term “rearrangement planning". We present General Part Assembly Transformer (GPAT), a transformer-based model architecture that accurately predicts part poses by inferring how each part shape corresponds to the target shape. Our experiments on both 3D CAD models and real-world scans demonstrate GPAT’s generalization abilities to novel and diverse target and part shapes.
more » « less
Full Text Available
TidyBot: personalized robot assistance with large language models

https://doi.org/10.1007/s10514-023-10139-z

Wu, Jimmy; Antonova, Rika; Kan, Adam; Lepert, Marion; Zeng, Andy; Song, Shuran; Bohg, Jeannette; Rusinkiewicz, Szymon; Funkhouser, Thomas (December 2023, Autonomous Robots)

Full Text Available
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning

Zakka, Kevin; Wu, Philipp; Smith, Laura; Gileadi, Nimrod; Howell, Taylor; Peng, Xue Bin; Singh, Sumeet; Tassa, Yuval; Florence, Pete; Zeng, Andy; et al (November 2023, Conference on Robot Learning)

Replicating human-like dexterity in robot hands represents one of the largest open problems in robotics. Reinforcement learning is a promising approach that has achieved impressive progress in the last few years; however, the class of problems it has typically addressed corresponds to a rather narrow definition of dexterity as compared to human capabilities. To address this gap, we investigate piano-playing, a skill that challenges even the human limits of dexterity, as a means to test high-dimensional control, and which requires high spatial and temporal precision, and complex finger coordination and planning. We introduce RoboPianist, a system that enables simulated anthropomorphic hands to learn an extensive repertoire of 150 piano pieces where traditional model-based optimization struggles. We additionally introduce an open-sourced environment, benchmark of tasks, interpretable evaluation metrics, and open challenges for future study.
more » « less
Full Text Available
Algorithms and Systems for Manipulating Multiple Objects

https://doi.org/10.1109/TRO.2022.3197013

Pan, Zherong; Zeng, Andy; Li, Yunzhu; Yu, Jingjin; Hauser, Kris (February 2023, IEEE Transactions on Robotics)

Full Text Available
Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

Ren, Allen Z; Dixit, Anushri; Bodrova, Alexandra; Singh, Sumeet; Tu, Stephen; Brown, Noah; Xu, Pen; Takayama, Leila Takayama; Xia, Fei; Varley, Jake; et al (November 2023, Conference on Robot Learning (CoRL))

Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for help when needed. KnowNo builds on the theory of conformal prediction to provide statistical guarantees on task completion while minimizing human help in complex multi-step planning settings. Experiments across a variety of simulated and real robot setups that involve tasks with different modes of ambiguity (e.g., from spatial to numeric uncertainties, from human preferences to Winograd schemas) show that KnowNo performs favorably over modern baselines (which may involve ensembles or extensive prompt tuning) in terms of improving efficiency and autonomy, while providing formal assurances. KnowNo can be used with LLMs out of the box without model-finetuning, and suggests a promising lightweight approach to modeling uncertainty that can complement and scale with the growing capabilities of foundation models.
more » « less
Full Text Available

Search for: All records